Batch Evaluation Metrics in Information Retrieval: Measures, Scales, and Meaning
نویسندگان
چکیده
A sequence of recent papers, including in this journal, has considered the role measurement scales information retrieval (IR) experimentation, and presented argument that (only) uniform-step interval should be used. Hence, it been argued, well-known metrics such as reciprocal rank, expected normalized discounted cumulative gain, average precision, either discarded tools, or adapted so their metric values lie at uniformly-spaced points on number line. These papers paint a rather bleak picture past decades IR evaluation, odds with community’s overall emphasis practical experimentation measurable improvement. Our purpose work is to challenge pessimistic assessment. In particular, we argue mappings from categorical ordinal data sets line are valid provided there an external reason for each target point have selected. We first consider general scales, categorical, ordinal, interval, ratio, absolute collections. connection two those categories also provide examples knowledge captured represented by numeric real Focusing then retrieval, document rankings data, effectiveness single value summarizes usefulness user population users any given ranking, able continuous variable ratio scale. That is, most current well-founded, and, moreover, more meaningful form than proposed “intervalized” versions.
منابع مشابه
Correlation and Prediction of Evaluation Metrics in Information Retrieval
Because researchers typically do not have the time or space to present more than a few evaluation metrics in any published study, it can be difficult to assess relative effectiveness of prior methods for unreported metrics when baselining a new method or conducting a systematic meta-review. While sharing of study data would help alleviate this, recent attempts to encourage consistent sharing ha...
متن کاملMeaning in philosophy and meaning in information retrieval (IR)
Purpose -The paper explores the question of whether the differences between meaning in philosophy and meaning in information retrieval (IR) have implications for the use of philosophy in supporting research in IR. Design/methodology/approach Conceptual analysis and literature review. Findings There are some differences in the role of meaning in terms of purpose, content and use which should be ...
متن کاملRanking Metrics and Evaluation Measures
In this work, we present a general guideline to establish the relation between a distribution model and its corresponding similarity estimation. A rich set of distance metrics, such as Harmonic distance and Geometric distance, is derived according to Maximum Likelihood theory. These metrics can provide a more accurate model than the conventional Euclidean distance and Manhattan distance. Becaus...
متن کاملTwo Axioms for Evaluation Measures in Information Retrieval
In this paper evaluation measures for information retrieval system outputs are investigated from a measurement theoretic point of view. Two axioms are introduced: the axiom of monotonicity and the Archimedian axiom. It is shown that the measures fullfilling these axioms are exactly the measures equivalent to some measure of the form ~a + ~d where a is the number of relevant retrieved documents ...
متن کاملMeaning-Focused and Quantum-Inspired Information Retrieval
8:30 am 9:00 am Coffee break 9:00 am 9:30 am Conference opening and introduction Derek Raine (Physics); Peter Jackson and Emmanuel Haven (Management) 9:30 am 10:30 am Plenary Talk Professor Edward Nelson Department of Mathematics Princeton University Title of Talk: Stochastic mechanics of particles and fields 10:30 am 11:00am Coffee break 11:00 am 12:30 pm Paper session I.: Meaning. Session Cha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3211668